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The activities of teaching, learning and examining 
in higher education have received remarkably 
little attention from experimentalists. The 
suggestion that belief, however hallowed, stands in 
need of empirical support is often met with 
incredulous opposition, and men whose academic 
work is based upon a rigorous testing of accepted 
ideas, tend to think it somehow indecent to apply 
the same standard of inquiry to their own 
teaching and examing practices. Few university 
teachers are even aware that many of their in- 
structional problems have already been investigated 
experimentally, and only a tiny minority take 
the trouble to acquaint themselves with the results. 

J. P. Powell 

“Experimentation and Teaching in 
Higher Education,” Educational Research, 

6, (1964), p. 179. 
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Foreword 



This is a polemical tract in the best sense of the term. We feel that 
the proper use of scientific knowledge is (1) to be the foundation for 
future additions to knowledge, and (2) to be the knowledge base for 
policy in the practical affairs of men. 

We do not hesitate to demand that future research on college teaching 
methods be guided by new theory. The old ideas about pedagogy at the 
college level are simply wrong. We have assembled in this monograph 
the data base to make this conclusion incontrovertible. We, therefore, 
adopt the stance that it is our responsibility to challenge researchers on 
comparative college teaching methods to start anew from better theory. 
Any further replication of the four decades of research here reanalyzed 
has an almost zero probability of changing the conclusions that speak so 
loudly from these data. 

We are equally unhesitant. in suggesting that policy-makers who decide 
about college teaching methods either use their prejudices as a basis for 
decision (which will produce policy no better than that grounded on other 
people’s prejudices) , or feel free to determine policy decisions on grounds 
other than allegations about pedagogy and learning (e.g. cost, space, 
time, convenience criteria) . 

So, we are polemical in charging our research colleagues to make a 
marked shift in the strategy of doing their research on pedagogy; and in 
urging academic policy-makers to broaden the grounds for their decisions 
on college teaching methods. 

We think the weight of our conclusions have the special merit of being 
data-based. We are not polemical by arguing a preferred position against 
all alternatives. We turned to a voluminous research literature on com- 
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Foreword 



paralive college teaching methods and asked: “How do the facts add up ? 55 
The answers are astonishingly clear, (indeed almost unique among be- 
havioral science data in this respect) : We cannot claim superiority for 
any among different teaching methods used to convey subject content to 
the student. 

Robert Dubin 
Thomas C. Taveggia 



September, 1968 

University of Oregon, Eugene, Oregon 
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CHAPTER 



1 



The Issues Posed 



It is an important part of the folklore of college teaching that certain 
methods of instruction are preferred over others. The most preferred 
methods probably vary by discipline as well as temperament of the pro* 
fessor. Just as important as having preferences is the supporting belief or 
belief-system that alleges superiority for the preferred teaching method. 

We, therefore, enter into an area of strong affect and little concern for 
facts in trying to assess the relative utility of various college teaching 
methods. Any conclusion we may draw regarding the facts comparing 
consequences of various college teaching methods are likely to be chal- 
lenged. to the extent that they violate the belief-system of a given group 
of professors or college administrators. The result may very well be that 
we will be forced to argue about the viability and desirability of given 
belief-systems and not data, or else we will have to acknowledge defeat 
in the face of the reader 5 s conclusion! ;t Don 5 t bother me with the facts, 
my mind is already made up.” 

These are times of vast change in educational performance, philosophy, 
and goals. It is a desirable assumption, however shaky its accuracy, to 
expect that the choice of changed directions will be guided by a significant 
knowledge base. Insofar as college teaching methods may be subject to 
change, we would hold that the direction the change takes should be 
guided by knowledge about the comparative advantage of a given method 
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THE TEACHING-LEARNING PARADOX 



of instruction. To this end we have marshaled all the available data from 
comparative studies of college teaching methods. 

We have adopted a fundamental procedure, practiced by the senior 
author, of adding up the data of a field, rather than the conclusions of 
researchers who have marshaled the data. In this respect we have made 
a unique contribution, for all the previous summaries of research in the 
area of comparative college teaching methods have summarized authors’ 
conclusions, and have not gone back to the basic data from which these 
conclusions derive. 

In cumulating the data from studies of comparative college teaching 
methods we are following the time-honored procedures of the natural 
sciences where it has long been the practice to add up the data on a subject 
as a basis for testing models of the phenomenon, or to build new models 
on the basis of ihe accumulated data. 



Teaching versus Learning 

Analytically we want to measure the utility of one college teaching 
method over another. We believe that teaching is a technology, the con- 
tent of which can be rationally ordered by some distinct! e model of the 
teaching process. Thus, the lecture method assumes the superior knowl- 
edge of the lecturer and therefore places in his hands the selection of 
subject matter to be covered, the depth of coverage to be employed, the 
balance between content and illustration, the length of the lecture period, 
and the frequency of the lectures in a given period of time. The authori- 
tarian image of the lecturer which derives from this model of the lecture 
teaching technology does not influence in any way the consistency of the 
conclusions which derive from the assumption of superior knowledge and 
information on the part of the lecturer. By way of contrast, self-study 
(this usually means a reading list and a limited period of time to com- 
plete the reading) limits the superiority of the instructor to a knowledge 
of the relevant bibliography of his field. Beyond that it is assumed that 
the student learns through interaction with printed materials. The book 
rather than the instructor becomes the teacher. 

It seems reasonable to assume that given such distinctive teaching 
technologies as lecture, on the one hand, and self-study, on the other hand, 
there should be measurable differences in outcomes of these two methods. 
It is the very reasonableness of such an expectation that leads to the 
conclusion that there will, indeed, be measurable differences between 
any two contrasting teaching methods. 

Underlying all belief-systems in the efficacy of one teaching method 
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over another is an implicit model of how teaching and learning are 
linked. It is invariably assumed that when a teacher is teaching a student, 
the student is learning because of the teacher’s intervention. This assumed 
linkage between teaching and learning is the morass in which much of 
the controversy about various teaching methods bogs down. For while 
the apparent focus of discussion is upon the alleged advantage of one 
teaching method over another, the outcome measured is usually declared 
to be some aspect of student learning. If it can be shown that learning of 
content is not differentiated between two or more teaching methods, then 
it may be alleged that: “The student learns a point of view;” “The student 
learns to think;” “The student learns sensitivity to others;” “The student 
achieves inspiration;” “The student develops a sense of relevance;” or 
countless other variations on these themes, all of which center on allega- 
tions regarding what the student learns most effectively from a preferred 
teaching method. 

It has only been in the current decade that recognition has grown 
apace that we really do not know what the linkage is between teaching 
and learning. Gage and Hilgard have been leaders in alerting the school- 
men to the need for establishing the clear and unequivocal links between 
a theory of learning and a theory of teaching. We subscribe wholeheart- 
edly to this position. This monograph will have made a significant con- 
tribution if it does nothing more than fortify the conclusion that we have 
not yet established adequate theories of the linkages between teaching 
and learning. 



Measuring the Results of Teaching 

The comparative study of college teaching methods has largely focused 
attention on measuring the results of college teaching through a final 
examination given to the students. Examinations of this sort are typically 
content-oriented, and are designed to determine how much of the content 
presented by the teacher can be recalled, after some delay, during the 
final examination of the student. 

The problem with which we are confronted is simply stated. Does the 
final examination in a course measure the learning of the student? Does 
the final examination in a course measure the teaching methods of the 
teacher? Or does the final examination in a course measure some com- 
bination of learning and teaching? We are in no position to give a positive 
answer to any of these questions. 

Most of the time we will be measuring whatever happened by the end 
of a course, utilizing a final examination and the grade assigned to it. 
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In rare cases we will have an opportunity to measure gains made by a 
student, from some measured level of knowledge before taking a course, 
to the measured level of knowledge determined by examination at the 
end of a course of instruction. In a few instances we have studies that 
evaluate other outcomes measured on the student body. 

We must, therefore, start with the difficult issue of deciding what, if 
anything, is being measured by the outcomes employed in the numerous 
studies of comparative college teaching methods. The easiest way to con- 
ceptualize this is to employ the familiar black box analogy. The inputs 
into a black box labeled “teaching-learning” situation are two or more 
distinctive teaching technologies covering the same subject matter and 



THE "TEACHING-LEARNING" BLACK BOX 



INPUTS 

Teaching method 
A 



Teaching method 
B 






OUTPUTS 

Student's Exam 
Performance— A 



Student's Exam 
Performance— B 



Teaching- 

Learning 

Black 

Box 



subject matter area. Whatever happened inside the black box, at the end 
of a prescribed period of tune known as the length of the course, an 
output is measured on the student body by a final examination. Now, 
without making any assumptions about what goes on inside the teaching- 
learning black box, we may simply conclude that one group of students 
subject to teaching method A, when compared with an equivalent group 
of students subject to teaching method B, will produce either the same 
output or a distinctly different output in the form of group mean examin- 
ation scores. 

If the measured outputs differ by a statistically significant amount we 
may then make one of two assumptions: (a) that the differential inputs 
represented by teaching methods are related to the outputs from the teach- 
ing-learning situation; or (b) that the measuring instrument known as 
a final examination is so unstable as to produce random differences of 
a magnitude greater than zero. 

If we make the second assumption we then focus on the measurement 




The Issues Posed 



5 



of outcome and contend that examinations are so inadequate that their 
vagaries are sufficient to account for major differences in measured group 
outputs. This assumption about the imperfections of examinations has 
significant implications for the administration of students in colleges an 
universities. It calls into question the entire student grading system the 
system for advancement to a higher level, the basis for making scho ar- 
ship and fellowship awards, and the retention of students for a normal 
academic cycle. Thus, to accept the second assumption is to call into 
question far more than merely the findings on comparative college teac ■ 
fag methods. We are not, however, asserting that because final examina- 
tions have become so highly institutionalized that this is sufficient to 
justify their existence, or their utilization to measure anything. We merely 
call your attention to some of the consequences of discrediting final 

examinations as measuring anything accurately. 

We start with the first assumption: that the differential inputs repre- 
sented by two or more teaching methods are related to the outputs from 
the teaching-learning situation. We do believe that the examinations used 
to measure the output of the teaching-learning black box are sufficiently 
standardized in the given comparative situation that any difference 
measured by this method, if statistically significant, may- not be ascribed 
to the variability in the measuring instrument itself. Indeed, in most o 
the comparative studies, the same final examination is used for the groups 
compared, so that the results are influenced equally by the character an 

quality of the examinations used to measure student performance 

If we look at the bulk of our findings and discover, as we will, that 
there are relatively few significant differences among the various teaching 
methods as measured by examinations, then one might argue, ingen- 
iously, that whatever the examinations measure they do not measure ^ e 
distinctive and unique inputs of compared teaching methods. This, 
of course, has been the principal argument of those who profess surprise 
whenever the teaching method they prefer does not turn out to be superior 
to one with which it is compared in a given study. This line of argume 
may be very significant and it is indeed one of the principal conclusions 
to which we come in Chapter 4. There we will point out as one of several 
recommendations that any strategy of continued studies of comparative 
college teaching methods designed to produce a significant payoff must 
proceed to examine outcomes of the teaching-learning black box other 
than student performance on final examinations. We will not argue, how- 
ever that because we find no differences between two teaching methods 
that there must be such differences if we will only search hard enough for 
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them. We prefer to leave the issue of whether differences really exist to 
the future research for resolution. 

The Teaching-Learning Black Box 

The fundamental conclusion of this study is that we can no longer 
afford to consider the teaching-learning black box as an unexplored realm. 
As already pointed out, Gage and Hilgard along with others have, during 
the current decade, raised questions about the appropriateness of ignoring 
the linkage between teaching and learning. We would now add to this 
that the state-of-the-art analysis represented by this monograph leads to 
an unequivocal conclusion that the most important analytical problem 
faced is to explore the content of the teaching-learning black box. 

We can no longer assume either a traditional or a comfortable belief 
about the linkages between teaching and learning. We simply do not 
know what these linkages are and it is high time that we explore them. 

The urgency of the need to explore the teaching-learning connection 
is emphasized by recalling to you that all the studies here analyzed deal 
with adult learning. We have limited our attention to a population of 
college-level students. These are typically 18 years of age and older and 
the average age is well above 20. There is a high probability that theories 
or models of teaching-learning appropriate to the young may not be 
appropriate to near-adults or adults. 

For one thing it is very clear that college students have an experience 
base as well as a volume of learned knowledge that far exceeds that of 
youngsters. This must surely have an impact upon the kind of teaching 
that would be particularly appropriate to learning at that age and ex- 
perience level. 

The college student is typically an active participant in the selection 
of subject areas of his own learning. This selection process is enforced 
by the institutional requirements of declaring a major and pursuing 
prescribed courses of instruction to fulfill the major and/or minor re- 
quirements for a degree. However minimal the amount of voluntarism on 
the part of the student in choosing his areas of study, the fact that there 
is some choice must certainly have something to do with the linkage be- 
tween teaching and learning at the college level. Indeed it begins to explain 
the contemporary cry and demand for relevance that is abroad in the 
land today among college students critical of the course offerings and 
teaching methods in their institutions of higher learning. Especially under 
conditions of strong occupational motivation the sophisticated college 
student of this generation may demand courses of instruction which his 
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own professors are less than capable of offering. An underlying reason 
for the generation gap may be the fact that the students, who are supposed 
to be learning, may know different things, perhaps know them better, 
than the professors who are expected to do the teaching. 

The choice behavior exercised by the student in selecting the subjects 
for his own instruction, and the knowledge base from which he is likely 
to both make this choice and evaluate the quality of instruction received, 
may be critical features of the student’s participation in the teaching- 
learning process. At the college level we may have to model the teaching- 
learning link in a fashion quite different from the models of immediate 
post-natal stimulus-learning, or first grade teaching-learning, or even 
high school learning-teaching. Indeed, our own conclusions from this 
survey that independent study is as effective as any face-to-face instruc- 
tion, when measured by student examination performance, should make 
clear that a great deal of learning resides in the student. 

It is important to add a third dimension to the teaching-learning 
situation by examining the student contribution, and especially student 
motivation to learn. Much of the thinking about student motivation at 
pre-school and grade school levels does not seem to make much sense for 
college-age students. Those teaching techniques that derive directly from 
these preconceptions about student motivation at these early levels would, 
therefore, seem to be singularly inappropriate for college instruction. It is 
notable that the concepts of the “mature personality” capable of “self- 
realization” have built into them the notions of individual independence, 
self-discipline, and willingness to choose among goals and their means 
of attainment. Clearly, motivation to learn affects the balance of inputs 
into the teaching-learning black box among collego-age students in the 
direction of increasing the amount of learning activity relative to the 
amount of teaching activity. 

At least three factors characterize the teaching-learning situation at 






the college level: (1) voluntarism on the part of the student in choosing 
the subjects of instruction; (2) a knowledge base possessed by the stu- 
dent for making judgments about the content and quality of instruction 
received, judgments which, in turn, influence the voluntary choices made; 
and (3) the complex of culturally derived expectations and behaviors 
which comprise what we loosely summarize as the motivation to learn. 
Thus, a more accurate characterization oi the input situation for the 
teaching-learning black box might look like the illustration below. 

It is not our purpose in this state-of-the-art survey to develop a model 
of teaching-learning at the adult or near-adult levels. We do, however, 
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Instructor INPUTS 

Subject knowledge 
Judgment 
Analytical skills 
etc. 





Teaching- 

Learning 

Black 

Box 



Student INPUTS 

Voluntarism 
Knowledge base 
Motivation (Adult) 
etc. 





believe that anyone working or doing research at the college level of 
instruction can most readily make useful contributions if this linkage be- 
tween teaching and learning becomes the center of their attention. 

The evidence from this analysis conclusively demonstrates that differ- 
ences among teaching methods occur only rarely as measured by a final 
examination. It seems safe to predict that any future studies replicating 
the four decades of research here summarized can only reproduce the 
results persented here. We have absolutely no reason to believe that to- 
morrow’s studies, if they merely replicate what was done yesterday, will 
produce any results that differ to any significant degree. 

The evidence is all in upon which we may base our conclusions about 
the relative utility of given methods of college teaching, when this utility 
is measured through final examinations: THERE ARE NO DIFFERENCES 
THAT AMOUNT TO ANYTHING. 



The "Other Results" Conclusions 

When repeated studies of comparative college teaching methods have 
revealed no differences between two or more methods, the researcher who 
anticipated results favoring his own preferred method is likely to allege 
that his preferred teaching technology produced “other results” that 
could be observed but not necessarily measured in the studies. Or, failing 
to observe these “other results,” the researcher might piously conclude 
that had they been under observation the “other results” would have 
been measurable. 
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This is obviously a very constructive position to take for it assumes 
automatically that the measured outcome, student performance on final 
examinations, does not really measure or differentiate between the varied 
inputs of two or more teaching methods. Thus the claim for “other results” 
turns out to be a disguised form of plea for an adequate theory of teach- 
ing-learning in which the output variables are more imaginatively defined 
than by the final examination grade. The disappointing aspect of the 
research literature is that the promise of building a viable model of the 
teaching-learning process contained in the belief that “other results” 
were achieved by various preferred methods of instruction has never 
been realized . 1 

We are here involved in the fundamental relation between fact and 
ideology. A professor who believes that discussion or a tutorial is most 
productive of a learning consequence takes comfort in the fact that no 
research has disproved this conclusion, but he scarcely finds the support 
for his ideological position in any empirical data. The administrator who 
decides from a cost standpoint it is more efficient to use large lecture 
classes can also point to the research literature and say, “My position is 
not disproved.” He cannot, however, just as his professorial antagonist 
cannot, claim that his conclusion is supported by the data. In this incon- 
clusive situation the decision-maker’s ideology will continue to dominate 
the operating conclusions by which programs of instruction at the college 
and university level are designed. We can only hope that bringing the 
state-of-the-art in college instruction up to date with respect to the research 
literature will make clear to those who choose among college teaching 
methods the ideological grounds for their choices. We also hope that it 
should now be clear that if research-based policy decisions are to be 
favored, that the research required to provide grounded policy decisions 
is yet to be done in the area of comparative college teaching methods. 

Summary— The Problem Posed 

The problem posed by this state-of-the-art review may be set forth 
simply. 

1 That this failing is not solely characteristic of educationists and educational 
researchers is tellingly revealed in the survey of the research literature made by 
Campbell and Dunnette in trying to tease out the measured consequences of sensi- 
tivity training. Their results show that none of the claimed advantages of sensitivity 
training had ever been measured in real situations and that either there were no 
particular advantages ascribable to sensitivity training or the outcomes measured 
were not relevant to thene advantages. See: John P. Campbell and Marvin D. Dun- 
nette, “The Effectiveness of T-Group Experiences in Managerial Training and De- 
velopment,” pre-publication copy. (The University of Minnesota, 1967, mimeo- 
graphed) 
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Given a population of seven million adults or near-adults attending 
two- and four-year colleges and universities as students, what can we say 
about the relations among various methods for instructing them and 
the outcomes produced when measured on final examinations in their 
courses? Does the research literature give us a definitive answer to this 
question? 

The results of our intensive reanalysis of data on comparative college 
teaching methods make it very clear that our intended goal has been 
achieved. We are able to state decisively that no particular method of 
college instruction is measurably to be preferred over another, when 
evaluated by student examination performances. We may also conclude 
that replication of the 91 studies examined in detail in this survey would 
not produce conclusions different from ours. 

Any future research on comparative teaching methods at the college 
level must move in new directions. We have suggested that a fruitful 
direction of further analysis will be to examine directly the links between 
teaching and learning for a student group of adults or near-adults. 
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CHAPTER 




The Sound and Fury, 
Signifying Nothing 



• ^jy r ““ natel y f ° Ur decades of research are represented in this analy- 

w ’ u w t0ry ° f theSG research efforts be S an in the period just following 
World War I. There is concern throughout the four-decade period with 
using the methods of science to establish the superior utility of one college 
teaching method over another. 

Two interesting themes are evident in this survey of the literature: 

« ( . X) P 6 first and dom inant theme is the optimistic assumption that a 
scientific methodology” could be applied to the analysis of comparative 
teaching methods. It was assumed that by carefully employing experi- 
mental controls, the contrast between two or more teaching methods could 
be objectively set forth and the consequences measured, usually through 
examinations. The problem of experimental design was viewed as a very 
simple one— -to clearly distinguish between two or more inputs and then 
to measure whether or not their respective outputs were alike or different, 
t did not matter whether the null assumption was made or whether a 
predicted difference in a given direction was set forth. In both instances 
the experimental design was identical. 

(2) The second theme relates to seeking an explanation for the re- 
search bindings. As the pile-up of results made clear that the preponder- 
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ance of evidence tested showed no differences, even between obviously 
distinctive teaching methods, the researchers turned attention to flaws in 
the research process to explain the non-conclusive results. If no differences 
resulted it was often alleged that this was because the measuring instru- 
ments were too gross to detect the '“true” differences that “really” existed. 
Under similar circumstances, alternatively it might be argued that there 
was not quite enough difference between one teaching method and another 
as to produce the desired and expected differences in measured outcomes. 

Throughout the four decades of research no one was willing to give up 
the belief that there had to be significant differences in outcomes produced 
by different college teaching methods. It has been only recently that the 
thoughtful researchers have begun to ask whether the fault lay with con- 
ceptualization of the problem rather than with the technologies of research 
employed. This, of course, is the issue we hope will be forced by the 
present monograph. 

We think it is time for the researchers in the field to reconceptualize 
the problem, as one of modeling the linkages between teaching and learning 
at the adult level. This becomes all the more pertinent because the present 
widespread use of educational TV, and the rapid development of pro- 
grammed instructional methods demand a significant broadening of our 
models of learning to include unique teaching techniques. In television 
instruction the direct face-to-face relationship between instructor and 
student is eliminated. Yet, in both instances, there is still involved a teach- 
ing technology. In another monograph we review the extant studies of 
comparisons between college instruction by television and face-to-face 
methods. We conclude that there is no significant advantage to face-to- 
face instruction when measured against conventional instruction by tele- 
vision and we ascribe the differences which do appear to differences of 
media rather than differences in instructional methods . 1 We may also 
predict that as future comparative studies are made between programmed 
instruction and other teaching methods, the same outcomes will obtain. 

The Persistence of Hope 

The question must be raised as to why, in the face of mounting evi- 
dence, it was more than four decades before the initial analytical problem 
was reformulated. Why didn’t the accumulated research speak to each 
new generation so that it would not replicate the same inconclusive studies 
of its ancestors? 

i"SeeT" Robert Dubin and R. Alan Hedley, The Medium May Be Related to the 
Message: College Instruction by TV (Eugene, Oregon: CASEA, 1969). 
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To this question one might simply answer, “When the facts get in the 
way of an ideological conviction, the facts must be bent or discarded, but 
the conviction should remain unchanged.” Thus, if college professors are 
convinced that tutorials or discussions are most productive of learning, 
and the research data do not disprove that conviction, it seems fair to 
continue to hold to that belief. The fact that the research findings do not 
prove that tutorials and discussions are superior is taken as irrelevant 
information. It leads to the conclusion: “If I am not proved wrong, then 
I have every reason to believe that I am still right.” But no one else was 
proved wrong either in their espousal of other teaching methods. The net 
effect of the inconclusive data was to permit each person to hold to his 
private preference concerning a given teaching method without the data 
demanding its alteration. 

There must be other reasons for the dominance of ideological justifica- 
tions for particular teaching methods right up to the present time. We 
will examine three influences: (1) the persistence of traditions about 
teaching; (2) the pressures for changes in teaching methods generated 
by rapidly growing enrollment and slowly increasing teaching faculties ; 
and (3) the influences of ideological developments largely external to the 
institution of higher education. 

The traditions of a profession, like college teaching, run deeply and 
change slowly. The generation of professors who were trained and teach- 
ing before and right up through World War II had largely experienced 
as students, and later in their teaching careers, small-class, tutorial-like, 
teaching situations. Small classes were central to the basic tradition of 
colleges and even universities. Where large lectures were employed they 
provided the exception in teaching practice in order that many students 
might enjoy the benefits of a brilliant lecturer or renowned figure. It 
seemed justified to conserve the time or the energies of the distinguished 
by permitting their performance in the lecture amphitheater. The situation 
changed after World War II, but the very need to move to larger classes 
and more impersonal relations between faculty and students in the class- 
room could well have confirmed, precisely because it seemed to be an 
exception to the tradition, the continuing preference for the historically 
certified “right” way to teach college students. Therefore, we have reason 
to believe that right up to the present time the modal preference of college 
professors is for the small size class, discussion-type interaction with 
students in a personalistic fashion. 

A significant structural influence on the preference for particular kinds 
of teaching methods was the rate of change in the growth of faculties in 
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relation to student bodies. Prior to the end of World War II the supply 
of professors grew at a rate comparable to the rate of growth of the student 
body. Under these circumstances the student-teacher ratio remained 
largely unchanged. From 1940 to 1945 the withdrawal of young men 
from the campus to serve in the armed forces actually reduced the student- 
faculty ratio. 

In the post-war era the pressure of very rapidly increasing rate of stu* 
dent enrollment was not matched by the growth of teaching faculty. Not 
only did an increasing proportion of professionally trained faculty mem- 
bers devote part or full time to research, but even those who were teaching 
exclusively experienced a decline in hours-per-week teaching load. Com- 
bined, these factors forced enlargement of classes and the utilization of 
teaching methods such as closed circuit television. The mass methods of 
instruction were invariably viewed as compromising “true” educational 
experiences in the interest of expediency in order to handle the increasing 
enrollments. Thus, teaching more students without a comparable increase 
in instructors necessitated a move away from the ideal of small classes 
with a Mr. Chips at the head of the seminar table. The change was viewed 
as a necessary evil and therefore no genuine legitimation of the new 
teaching methods was attempted or achieved. It is not surprising therefore 
that preferences for the “classical” methods of college instruction should 
persist even in the face of demonstrated need for inventing or considering 
alternatives. 

Developments that took place essentially outside the academy but had 
repercussions upon it comprised a third influence upon the ideological 
commitments of American university professors to classical methods of 
college instruction. The post-World War I period was one in which psy- 
chologies gave special emphasis to individual differences and the need 
for designing educational opportunities to emphasize and realize the 
differing potentials of individuals. The progressive education movement 
had its roots in an empirical philosophy of the individual. It seems obvious 
that the informal, student-centered instructional methods would be most 
consistent with a belief in the desirability of maximizing the potential of 
individuals of differing talents and abilities. We may then conclude that 
in the period between the world wars, the philosophic orientation of pro- 
gressive education, grounded in the psychology of individual differences, 
would have its repercussions upon the colleges and universities, at least 
with respect to teaching methods or preference for teaching methods. 

In the 1950’s a similar philosophic movement reached it peak, a move- 
ment characterized by a commitment to what came to be called group 
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dynamics and by a belief that the individual, being the product of group 
experiences, achieved his maximum self-realization in small group con- 
texts. Indeed, this philosophic movement remains in current high fashion, 
under the name of sensitivity training. In this philosophic tradition the 
pedagogy most adaptable to maximizing small group experiences is the 
small group itself, with the instructor operating as a resource to the 
experiencing group. 

These successive philosophic traditions, although relatively different 
from each other, both had identical consequences for ideological pref- 
erences of university professors for a given method of teaching. 

By combining these three strands of analysis we may conclude that the 
traditional beliefs about teaching, the relations between demand for teach- 
ing services and their supply, and the impact of philosophic positions 
upon professors’ views of the teaching-learning linkage, all served to re- 
inforce the generally-held belief that discussion and tutorial methods of 
instruction were to be preferred at the university and college level . 2 

The ideological conviction that tutorial and small group situations were 
most efficacious for college-level teaching led to studies and experiments 
to demonstrate that this was, in fact, a scientifically grounded conclusion. 
The motive for doing the comparative studies was to prove a point already 
well established in the minds of professors. Only when demands could not 
be satisfied within traditional methods were professors willing to move to 
other methods of instruction, including large lecture classes, educational 
TV, and ultimately, programmed instruction. There was no experimental 
evidence that supported the utility of these alternate methods over the 
traditional modes of college instruction. Nor was there, and this should 
be emphasized, any evidence that these alternate modes of instruction 
were any worse than the traditional ones. 

The knowledge base for making innovations in instructional methods 
was neutral with respect to the particular innovations considered. Perhaps 
this very neutrality of the knowledge for making an informed decision 
based upon the measured results of given methods of instruction is the 
key to understanding how, in the face of a very strong and determined 
ideological conviction to the contrary, large classes, impersonal instruc- 

2 It should be strongly emphasized here that since the data from which we are to 
reach our conclusions are based on undergraduate instruction in colleges and uni- 
versities, we have not drawn into the argument any conclusions based upon the 
principal mode of instruction at the graduate level. This, however, would also have 
an impact in exactly the same direction as the three factors we have already examined 
because much of graduate instruction approximates an apprenticeship in which the 
individual instructor and the individual student have more or less continuous 
interaction. 
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tional relationships, and even machine teaching, were able to gain a sig- 
nificant place in the repertory of college teaching methods. Put another 
way, the rate of social change may be markedly increased by the very lack 
of evidence that a given system of behavior is either better or worse than 
an alternative. 

Ideological Roots of Comparative Research 

We here simply present a straightforward recital of what the various 
researchers did, and in many cases why they did it, in the course of making 
comparative analyses of college teaching methods. The account is chrono- 
logical so that the reader may have a sense of the history of ideas that have 
characterized the justification for comparative research. 

We have already pointed out that the progressive education movement 
influenced ideas about college teaching. We might therefore expect that 
some of the early studies of comparative college teaching methods would 
come from courses in. education and/or psychology. 

In 1924 we had the following study : 

During the first semester of the school year 1922-23 an experiment was 
conducted by the authors to determine what differences existed in the 
measurable results of instruction in a large (lecture) and small (discus- 
sion) class in an education course offered in the School of Education of 
the University of Michigan, (p. 1) ... results in terms of semester 
averages indicate no appreciable difference in the achievement of the 
large and small class included in this study, (p. 12) 3 

This is one of the early studies to find that no differences indeed existed. 

The influence of Goodwin Watson at Teachers College is clearly re- 
vealed in the next report. 

The present experiment is one of the attempts which Dr. Goodwin B. 
Watson and the author have made to secure better results from the 
teaching of Educational Psychology. The lecture method did not seem 
to recommend itself as wholly desirable and yet, as it is such a common 
type of teaching it certainly was desirable to compare other plans with 
it. The experiment, therefore, set out to compare the lecture method 
with a plan involving class discussion, (p. 454) 

...we can only conclude that for two large (N=120-170) sections of 
graduate students in Educational Psychology, meeting once a week for 
two hours, with little previous experience or feeling in favor of the 
discussion method, the lecture method is superior to the discussion 
method in producing improvement in things measured by tests, (p. 461) 
The problem that must be solved is not the question, “Is Method A 

3 J. B. Edmundson and F. J. Mulder, “Size of class as a factor in university instruc- 
tion,” Journal of Educational Research, 9 (1924), pp. 1-12. 
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better than Method B or Method C?” but rather, “What are the conditions 
under which Method A produces most effective results? What are the 
situations where Method B is best?, etc.” There are times when the 
lecture method is an effective aide in securing desirable changes in 
students. There are other times when the lecture method is harmful and 
should be replaced by discussion or individual projects or something 
else. (p. 462) 4 

But this time it had already become apparent that the analytical problem 
was not the old industrial management problem a la Taylor of discovering 
the “one best method” but rather asking the question, does each method 
employed have its unique advantages? It is notable that this question 
persists through the literature and always remains unanswered in terms 
of empirical data. Spence had no data on which to base his final conclu- 
sion, nor do succeeding authors who insist that each method has its unique 
advantages. 

From this point on there is constant surprise expressed because the 
expected differences in favor of traditional teaching methods do not 
emerge. The following study reports an analysis of the teaching of English 
History: 

This study was planned and carried out with a view to throwing some 
light on the question of the value of quiz sections (discussion methods) 
in the teaching of history. Two sections of English History were given 
during the year of 1927-28, one meeting three times a week for lectures 
only, the other meeting three times a week for lectures and once a week 
in small groups, (p. 276) 

... we are led to a conclusion contrary to all expectations — namely, 
that the added hour a week devoted to quiz sections had no significant 
value, (p.282) 5 

In the depths of the depression the Department of Psychology at the 
University of Minnesota determined to investigate the relative utility of 
lecture-quiz and all-lecture methods of instruction. For the introductory 
course the following results obtained: 

In view of the available experimental evidence and the increasing prob- 
lems brought about by quiz sections, the Department of Psychology 
decided to conduct a controlled investigation to ascertain, if possible, 
the relative merits of the lecture-quiz and. all-lecture method, (p. 33) 
From the . . . findings we may conclude that under the conditions sur- 

4 R. B. Spence, “Lecture and class discussion in teaching educational psychology,” 
Journal of Educational Pyschology, 19 (1928) , pp. 454-462. 

5 D. G. Barnes, and H. R. Douglass, “The value of extra quiz sections in the 
teaching of history,” U. of Oregon Publications, Educational Series I: 1929, 1 , No. 
1, pp. 276-284. 
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rounding this experiment, there is no difference in the value of the 
two methods employed, (p. 48) 6 



In this study Longstaff reviewed the literature tip to 1932 and in 1942 
and in 1955 the distinguished psychologists Dale Wolfle and Bi ™ey and 
McKeachie reviewed the research literature on comparative college teach- 
ing methods and concluded that Longstafi’s 1932 review still accurately 
described the results of research. As Bimey and McKeachie say: 



Teaching methods: summary. In 1942, Wolfle summarized re .arch up 
to that time by repeating Longstaff’s statement of 1932: “The experi- 
mental evidence submitted to the present time tends to support the 
general conclusion that there is little difference in achievement in large 
and small classes and, also, that it makes little difference as to what 
method of presentation of the materials of the course 1S u ® e( J; J- U< "• 
Wolfle, “The first course in psychology,” Psychological Bulletin, 39, 
(1942), pp. 685-712] The third decade of research has not outdated 
Longstaff’s statement. However, recent research does hold forth the prom- 
ise that in the next decade we will have a better understanding of the 
effect of various teaching methods on student learning. 7 



Thus we have in neat historical order three decades of research reviewed 
with the clear-cut conclusion that no demonstrated differences can be 
found in the research literature with respect to college teaching methods. 
But the final reviewer in 1955 still maintains an optimistic expectation 
that such differences eventually will emerge from the future research. 

By 1940 the research results were being clearly incorporated into the 
textbooks on college teaching. For example, Cole reached the following 
conclusion after summarizing ten studies (there being many more already 

in the field) : 



In so far as mere mastery of elementary subject matter is concerned, the 
lecture method is as good as any other. The results are sometimes almost 
identical for the lectures and discussion groups, sometimes slightly in 
favor of the discussion classes, and often appreciably in favor of the 

lecture classes. 8 



In the post-World War II period we can see emerging the influence of 
the group dynamics movement in giving direction to the formulation of 
research problems. Thus, for example, the term non-directive tends to be 



6 H. P. Longstaff, “Analysis of some factors conditioning learning in general 
psychology,” Journal of Applied Psychology, 16, (1932) , pp. 9-48; 131-166. 

7 R. Bimey, and W. McKeachie, “The teaching of psychology: A survey of research 

since 1942,” Psychological Bulletin, 52, No. 1, (1955) p. 58. 

8L. Cole, The Background for College Teaching (New York: Farrar & Rinehart, 

1940), pp. 324-325. 
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used in place of “discussion method” or “tutorial,” The following is a 
typical description of the new direction for research on college teaching. 

The aim of this study was to evaluate the over-all effectiveness of non- 
directive teaching of an undergraduate course in general psychology 
as compared to the traditional lecture-discussion, (p. 19) 

The results of this phase of the experiment indicate that non-directive 
teaching, as outlined in this study, is not as effective as the traditional 
teaching techniques in helping students master the factual subject mat- 
ter of a course in general psychology as measured by an examination 
based on knowledge of textbook material. 9 

Given the reality of large lecture classes, especially in introductory 
courses, studies emerge in which, perhaps to salve the conscience, it is 
attempted to find out whether the continuing belief in small intimate 
classes is justified. Thus we have the following report on the introductory 
course in psychology. 

The present investigation was originally undertaken as a local report to 
the department and the administration to see how well or how poorly 
students exposed ... to impersonal, mass, education might do. (p. 298) 
The main results were as follows: The lecture sections averaged for the 
six quarters three points higher than the small classes, with superiority 
appearing five quarters out of six, and the sixth coming out a tie. In no 
case did the smaller group earn a higher course total, (p. 299) 

In spite of all . . . arguments in favor of large lecture sections, which are 
largely based on statistics on grade-point earnings, the writer must admit 
that he is not 100 per cent convinced of the efficacy of such “mass educa- 
tion.” (p. 300) 10 

What is, of course, interesting is that the writer refuses to be “100 per 
cent convinced of the efficacy of such ‘mass education.’ ” The point is not 
that the superiority of the large class has been demonstrated since the 
differences are not statistically significant as reported in this study. What 
the writer is still not willing to admit is that the presumed superiority of 
the small discussion class is also not proved by this or preceding studies. 

The following is an illustration of how the language changes but the 
problem remains the same. 

This study limited itself to the investigation of two styles of teaching, 
operationally defined below as “directive” and “permissive.” 

Summary and Conclusions — The D and P groups showed no significant- 
differences on the objective-final when taken as a whole, but when the 

9 M. J. Asch, “Non-directive teaching in psychology: An experimental study,” 
Psychological Monographs, 45, No. 4, (1951), pp. 1-24. 

10 Richard W. Husband, “A statistical comparison of the efficacy of large lecture 
versus smaller recitation sections upon achievement in general psychology,” Journal 
of Psychology, 31, (1951), pp. 297-300. 
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two teaching methods were analyzed for their effects on the “better stu- 
dents” and the “poorer students” it was demonstrated that the directive 
sections were more beneficial to the poorer student, (p. 184) 11 

In Wispe’s study the comparison is now between “directive” and “per- 
missive” teaching styles. The conclusion reached is the standard one of 
“no difference.” But there is a suggestion that when student bodies are 
differentiated according to their ability, the “directive” method may be 
superior. 

By 1954, Guetzkow, Kelly, and McKeachie experimentally studied 
three distinctive methods of college teaching. Their description of the 
research and their conclusions are self-explanatory. 

. . . the design may be stated in the null form : There will be no significant 
differences in outcome with respect to course objectives between students 
whose class meetings are taught by Method 1 (recitation-drill) and those 
students who are taught by Method 2 (group-discussion) or by Method 
3 (tutorials), (p. 195) 

By and large, we found no differences between the three teaching methods 
from the point of view of educational outcome. The few statistical differ- 
ences, in general, favored the recitation-drill method,, The results of the 
experiment, of course, were contrary to our original expectations. They 
constitute a clear confirmation of the general conclusion derived from 
experiments on instructional procedures since the early 20’s. As Good 
recently put it, after reviewing the literature in this area, “The com- 
plexity of the teaching-learning process is such that attempts to establish 
the relative merit of a ‘general method’ of teaching are likely to prove 
inconclusive.” [C. V. Good, “Colleges and universities: VIII. Methods 
of teaching,” in W. S. Monroe (Ed.), Encyclopedia of Educational Re- 
search, New York: Macmillan, 1952, pp. 273-278.] (pp. 205-206) 12 











Thirty years after the first study cited here we are still getting the same 
conclusions. Guetzkow, Kelly and McKeachie at least maintained the pos- 
ture of optimism that must surely sustain researchers after thirty years 
of failure to demonstrate what they expect to find. 

As if to recognize that there is indeed an ideological commitment to 
given methods of instruction and irrespective of the research findings, 
McKeachie followed up the work he did with Guetzkow and Kelly in the 
same year and reached the following conclusions. 

One would expect that the controversy between our education’s author- 
itarians and permissivists would long ago have been resolved by the 




11 L. G. Wispe, “Evaluating section teaching methods in the introductory course,” 
Journal of Educational Research, 45, (1951) , pp. 164-186. 

12 H. Guetzkow, L. E. Kelly, and W. J. McKeachie, “An experimental comparison 
of recitation, discussion, and tutorial methods in college teaching,” Journal of Educa- 
tional Psychology ,45, (1954) pp. 193-207. 
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cold logic of experimental studies. Unfortunately, this just hasn’t hap- 
pened. The published experimental studies are not in agreement and 
there are a host of unpublished studies which remain unpublished because 
the two methods used produced no significant differences in outcomes. 

(p. 146) 

What are we to conclude from these studies? While there is a dearth of 
follow-up data, with such slender results at the end of the courses our 
hope that either method produces significantly greater long-time benefits 
is probably unrealistic. Should everyone go his own way and teach any 
way he pleases? Personally, we are not willing to go quite so far, but cer- 
tainly none of us should exclaim with horror, “His classes are instructor- 
centered.” As psychologists, however, we believe in research. Why has 
research on student-centered versus instructor-centered teaching seemed 
to lead up a blind alley? (p. 148) 13 

In a 1958 review of 66 research reports the Bureau of Institutional 
Research at the University of Minnesota reached the following conclusion. 

Undoubtedly the most striking finding of this review is the consistent 
inability of investigators to demonstrate statistically significant differ- 
ences between the experimental and control methods of teaching. This 
seemed to hold true regardless of the subject field or class size being 
examined. In the few studies that did suggest significant findings the 
differences were generally small and of doubtful practical significance. 
Unfortunately the finding of no significant differences does not legiti- 
mately allow one to conclude that differences in teaching efficacy are 
not related to method and/or class size. Rather, the fact that essentially 
none of the investigations have been able to demonstrate practical 
differences suggests that some careful examination of the experimental 
methodology and evaluation procedures may be in order. 14 

It will be noted here that a direction is being suggested in which the 
methodology of research rather than the conceptualization of the research 
problem is considered the most fruitful line of further investigation. 

The preoccupation with methodology is suggested again in the same 
year by Nachman and Opochinsky. 

Reviews of teaching research have consistently concluded that different 
teaching procedures produce little or no difference in the amount of 
knowledge gained by the students. This same conclusion has been reached 
despite the fact that experimenters have employed a wide variety of 
independent variables, such as lecture versus discussion classes, instruc- 
tor-centered vs. student-centered classes, large versus small classes, var- 

13 W. J. McKeachie, “Student-centered versus instructor-centered instruction,” 
Journal of Educational Psychology , 45, (1954) . McKeachie went on to suggest some 
of the reasons why the research results have been inconclusive. 

14 Bureau of Institutional Research, “A Review of the Literature Concerning 
Studies of College Teaching Methods and Class size,” University of Minnesota, 1958, 
p.4. (mimeographed) 
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ious types of TV classes, etc. These results are surprising if one considers 
that much of the research was instigated by the hypothesis that differ- 
ences would be found. Furthermore, it appears as if most educators still 
assume that classroom techniques do in fact have specific effects. Why 
then have differences not been found? 

The purpose of this paper is to examine an alternative hypothesis, namely, 
that the different teaching methods have, in fact, produced differential 
amounts of learning but that these effects have been masked in the 
measurement process. 15 



We started in 1924 and we end with a quotation from the widely used 
Handbook of Research on Teaching published in 1963, almost 40 years 
after the initial quotation. 



The effectiveness of lecture and discussion methods has often been com- 
pared. Since discussion offers the opportunity for a good deal of student 
activity and feedback, it could, in theory, be more effective than the 
lecture method in developing concepts and problem-solving skills. How- 
ever, since the rate of transmission of information is slow in discussion 
classes, we would expect lecture classes to be superior in helping students 
acquire knowledge, (p. 1126) 

Despite the many findings of no significant differences in effectiveness 
between lecture and discussion, those studies which have found differ- 
ences make surprisingly good sense. In only two studies was one method 
superior to the other on a measure of knowledge of subject matter; both 
studies favored the lecture method. In all six experiments finding signifi- 
cant differences favoring discussion over lecture, the measures were 
other than final examinations testing knowledge . . . 

When one is asked whether lecture is better than discussion, the appro- 
priate counter would seem to be, “For what goals?” 16 



We have here, of course, reviewed many more studies than those 
covered by Professor McKeachie in his analysis in the Handbook. Pro- 
fessor McKeachie may be quite right in suggesting that the rare cases 
where differences favor one method of teaching over another provide us 
with possible clues as to why. It is clear, however, that McKeachie prefers 
the discussion method, based upon the work of researchers, like N. R. F. 
Maier, who have focused their analytical attention solely on discussion 
groups, rather than being grounded in comparative studies of two or more 
teaching methods. 17 



3.5 M. Nachman, and S. Opochinsky, “The effects of different teaching methods: 
A methodological study,” Journal of Educational Psychology, 49, No. 5, (1958), pp. 

245-249. , , „ 

16 W. J. McKeachie, “Research on teaching at the college and university level, 
in N. L. Gage (Ed.), Handbook of Research on Teaching (Chicago: Rand McNally 
& Co., 1963) pp. 1126-1127. 

17 See W. J. McKeachie, “The discussion group,” Memo to the Faculty, No. 14. 
(Ann Arbor, Michigan: Center for Research on Learning and Teaching, 1965). 
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The purpose of this brief review is not to prove how stupid, or narrow- 
minded, or ideologically committed have been the researchers who have 
compared college teaching methods. We simply want to demonstrate that 
the data are overwhelming in the direction of no differences among various 
methods of college instruction. In the next chapter we make one last mas- 
sive effort to demonstrate this all over again. We hope our demonstration, 
because it reanalyzes the data rather than the conclusions of the various 
studies, will be all that much more convincing than have previous sum- 
maries. We are convinced that approximately 40 years of research speaks 
the truth. It is now time to turn to a reconceptualization of the analytical 
problem. 

We can no longer be satisfied that there are pedagogical theories that 
confirm and predict the advantage of one teaching method over another. 
We are now convinced that the proper conceptualization of the problem, 
already foreshadowed in the Guetzkow, Kelly, McKeachie summary, is 
to build a model or models of the learning-teaching processes in which 
pedagogy is only one input into the process, although admittedly a 
complex one. 






CHAPTER 



3 

There Are No 
Differences 



Academic policies regarding the use of various methods of teaching 
at the college level should be grounded in research. The data exist upon 
which to base grounded policy decisions. In this chapter we present the 
summary conclusions setting forth the results of comparative analyses of 
college teaching methods. The methods utilized in obtaining these results 
are presented in APPENDIX A, and the detailed results not reported in this 
chapter are contained in APPENDIX B. 

How It Was Done 

All previous summaries of the results of individual comparative studies 
added up the conclusions of separate researchers. The output of such 
summaries was, therefore, a set of conclusions about conclusions. 

We did not follow this procedure. Our analysis is based upon the actual 
data contained in the individual comparative studies. It is our belief that 
sometimes researchers mistake their own data and reach faulty conclu- 
sions. We wished to avoid this source of error. 

The data utilized fell into two categories. In a number of studies ( 18 ) , 
the author only reported the mean performance scores of groups taught 
by different methods. These data could only be used to determine that one 

24 




There Are No Differences 



25 



method of instruction produced better examination results than another, 
or that there were no differences measured. These studies, therefore, re- 
ported only the sign of the differences. The magnitude of the differences 
was unimportant. Such data are the least precise, but provide us with the 
opportunity to make a sign test (over many studies, is one method of 
instruction favored over another regardless of the amount of measured 

difference in examination scores?). . 

Where data on the distribution of scores (means, standard deviations, 
sample sizes) for the groups compared were available, we were able to 
use or calculate a standardized statistical measure of the differences in all 
possible mean examination scores. This measure provided high precision 
in determining the actual amount of difference between groups’ scores m 
each comparison. 

This identical procedure was then repeated for all the studies where 
sufficient data were available, employing only the independent compari- 
sons. In many studies a number of comparisons were made in which the 
same groups were compared several times. Thus, results on mid term 
examinations and final examinations might be reported. We limited inde- 
pendent comparisons to a single comparison, usually the final examination 
results, between two groups of students. Thus, even though comparisons 
between the same two groups were reported in a study for midterm and 
final examinations or for term papers and examinations, we chose only 
the final examination results as being independent. We thus eliminate 
repeated comparisons of the same two student groups which obviously are 

not independent. 

In this chapter each chart and the accompanying statistics report the 
results of the independent comparisons. Each chart shows the actual 
distribution of standardized differences and reports the statistics for the 
distribution as well as the sign test for the independent comparisons. The 
footnote to each chart reports the sign tests for all differences measured. 

We thus have four different analyses of the data. 

(1) The sign test of all differences measured, representing the greatest 
number of possible comparisons. 

(2) The distribution of standardized differences in all measured group 
examination scores and associated statistics, which had the next largest 
number of possible comparisons. 

(3) The sign test of those comparisons in all of the studies which were 
considered independent comparisons, with a smaller number of cases than 
the previous two categories. 
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(4) The distribution of standardized differences in measured group 
examination scores and associated statistics for independent comparisons 
only, the smallest number of comparisons utilized. 

Briefly then, our method was to use examination data employing objec- 
tive measures of the differences obtained in group examination scores. 
We proceeded from the least precise objective measure to the most precise 
objective measure, ending up with four distinctive ways to measure the 
differences in student performance for every comparison analyzed. 

The entire analysis involved reading every study carefully, abstracting 
the data from it, translating the data to a form capable of computer proc- 
essing, and designing computer programs to analyze the results. 1 

It Pays to Study 

The most obvious comparison that may be made in determining whether 
anything makes a difference in examination performance by college stu- 
dents is to contrast a group of students who studied with those who did 
not. Throughout the entire period of his schooling the student is reminded 
of the desirability of studying. The terms “school” and “study” almost 
become synonymous. Belief in this identity is often shared by schoolmen. 
This belief is so strong that few functionaries of schools, or researchers 
on them, have ever done research to confirm or challenge the identity. 

We found two studies in the literature which compared some form of 
study with no study and evaluated their respective outcomes on examina- 
tions covering ability to recall or prove knowledge of course content. 
These studies had a total of six comparisons between groups of students 
who studied and those who did not, all of which were independent com- 
parisons. The results are significantly in favor of study. 2 

When it comes to learning a specific subject matter we have some 
modest assurance that for college students studying does make a difference. 
This should be a very comforting conclusion for those who believe that 
a college or university education, and study, are somehow or other 
linked. Whatever there is about the college and university environment 
that makes studying by students a more probable activity than not study- 
ing, must be taken as an important contribution of the higher educational 

1 The computer programs utilized and the detailed results are available to any user 
by application to the librarian, Center for the Advanced Study of Educational Ad- 
ministration, University of Oregon, Eugene, Oregon 97403. Data will be provided on 
a cost basis. 

2 Studies number 057 and 062 report in every comparison that studying produces 
far superior examination grades than could be achieved by students who did not 
study the course content. The studies are listed by number in the bibliography. 
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institution. Perhaps, this may even be its most important contribution — 
to develop the habits of study which are, or may be, the precondition of 
learning. At least we can start with meager data which are definitive in 
establishing the value of study when compared with no study in influencing 
college student grades. 

Teaching Method Comparisons 

We have organized our comparisons among teaching methods along a 
scale which at one end emphasizes instructor-centered classroom situa- 
tions, and at the other end emphasizes student-centered learning situations 
which may or may not be found in a classroom setting. Thus, we proceed 
systematically from an examination of lecture only compared to other 
instructional methods, to a comparison of independent study with those 
methods with which it has been contrasted. It will perhaps help to orient 
the reader if we briefly describe the characteristics represented by this 
broad range of teaching-learning situations. 

To reiterate a point made in the first chapter, we believe that teaching 
is a technology. Teaching consists of a set of ideas to be communicated, 
models of the teaching process and implicit assumptions concerning the 
teaching-learning linkage, and a set of instructional activities summed up 
as the “method” of teaching. 

Researchers have been attempting to compare and contrast different 
college-level teaching technologies for approximately 40 years. We may 
classify these technologies along a number of different, yet overlapping, 
dimensions. For example, two dimensions are the extent to which emphasis 
is placed upon the instructional activities of the teacher, or the degree to 
which learning is considered to be a social event for which group behavior 
is important. 

In this chapter, we present the results of experimental comparisons of 
different teaching technologies. In so doing, we shall briefly discuss some 
of the characteristic features of the different technologies which have 
received attention in the research literature. 

FACE-TO-FACE INSTRUCTION 

Face-to-face methods of teaching such as the lecture, group-discussion, 
and the tutorial are explicit examples of teaching technologies based on. 
traditional assumptions concerning the teaching-learning linkage. The 
instructor is not only assumed to be a sufficient condition but also a neces- 
sary condition for learning, and outputs from the teaching-learning situa- 
tion are assumed to be a function of differential teaching inputs. 





